Inter-Speaker Scaling of Poly-Segmental Formant Ensembles
نویسنده
چکیده
A linear-scaling approach is described for handling inter-speaker variations. The approach is motivated (i) by the similarity commonly observed amongst the formant-patterns resulting from different speakers’ productions of the same utterance, and (ii) by the fact that there are linear-scaling properties associated with similarity. In practical terms, linear transformations of the formant-patterns amongst different speakers are sought and interpreted as a set of scaling relations; the formant patterns are obtained from an ensemble of phonetically-varying segments. Using multi-speaker formant data on Australian English “hello”, the ensemble scales are found to explain the bulk of inter-speaker differences. The approach is independent of segmental structure; it uses only linear regression as its main computational machinery.
منابع مشابه
Linear scaling of vowel-formant ensembles (VFEs) in consonantal contexts
There are familiar terms such as ‘‘contour’’ and ‘‘trajectory’’ to refer to a vowel formant frequency as a function defined on the time axis, but there is no readily understood term for the analogous idea of how a formant behaves on the ‘‘vowel axis’’. For this we introduce the concept of a vowel-formant ensemble (VFE) as the set of values realized for a given formant (e.g., F2) in going from v...
متن کاملLinear scaling effects of phonetic
Systematic effects of phonetic context on vowel-formant transitions are uncovered using the hypothesis (Broad & Clermont 2002) that, for a given speaker and a given formant, the relative spacing between vowels is invariant, and unaffected by consonantal context or by location within a syllable. The assumed invariance implies that, within and between contexts, inter-vowel spacings will be geomet...
متن کاملSpeaker conversion in ARX-based source-
A speaker conversion framework for formant synthesis is proposed. With this framework, given a small set of a target speaker’s utterances, segmental features of an original speech can be converted to those of the given speaker. Unlike other speaker conversion frameworks, further voice quality modification can also be applied to the converted speech with conventional formant modification techniq...
متن کاملAcoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potency
We present some evidence indicating that phonetic distinctiveness and speaker individuality, are indeed manifested in vowels' vocal-tract shapes estimated from the lower and the upper formant-frequencies, respectively. The methodology developed to demonstrate this dichotomy, rst implicates Schroeder's [8] acoustic-articulatory model which can be coerced to yield, on a per-vowel and a per-speake...
متن کاملProsodic and segmental factors in foreign-accent conversion
We propose a signal processing method that transforms foreign-accented speech to resemble its native-accented counterpart. The problem is closely related to voice conversion, except that our method seeks to preserve the organic properties of the foreign speaker’s voice; i.e., only those features which cue foreign-accentedness are to be transformed. Our method operates at two levels: prosodic an...
متن کامل